Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 58000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 4.4 MiB |
| Average record size in memory | 80.0 B |
Variable types
| NUM | 10 |
|---|
Reproduction
| Analysis started | 2020-08-25 01:51:36.003331 |
|---|---|
| Analysis finished | 2020-08-25 01:51:53.993236 |
| Duration | 17.99 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
A8 is highly correlated with A5 | High correlation |
A5 is highly correlated with A8 | High correlation |
A4 is highly skewed (γ1 = 31.68785333) | Skewed |
A6 is highly skewed (γ1 = -21.86181042) | Skewed |
A2 has 35878 (61.9%) zeros | Zeros |
A4 has 38055 (65.6%) zeros | Zeros |
A5 has 706 (1.2%) zeros | Zeros |
A6 has 18420 (31.8%) zeros | Zeros |
A9 has 20134 (34.7%) zeros | Zeros |
A1
Real number (ℝ≥0)
| Distinct count | 76 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.23829310344828 |
|---|---|
| Minimum | 27.0 |
| Maximum | 126.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 453.2 KiB |
Quantile statistics
| Minimum | 27 |
|---|---|
| 5-th percentile | 37 |
| Q1 | 38 |
| median | 45 |
| Q3 | 55 |
| 95-th percentile | 79 |
| Maximum | 126 |
| Range | 99 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 12.23808169 |
|---|---|
| Coefficient of variation (CV) | 0.2537005541 |
| Kurtosis | 6.50879166 |
| Mean | 48.2382931 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 2.180838482 |
| Sum | 2797821 |
| Variance | 149.7706435 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 37 | 13308 | 22.9% | |
| 55 | 5949 | 10.3% | |
| 56 | 5658 | 9.8% | |
| 41 | 3299 | 5.7% | |
| 45 | 3251 | 5.6% | |
| 44 | 2596 | 4.5% | |
| 49 | 2439 | 4.2% | |
| 43 | 2438 | 4.2% | |
| 46 | 2302 | 4.0% | |
| 51 | 1802 | 3.1% | |
| 53 | 1700 | 2.9% | |
| 42 | 1379 | 2.4% | |
| 38 | 1285 | 2.2% | |
| 47 | 1175 | 2.0% | |
| 48 | 1150 | 2.0% | |
| 39 | 923 | 1.6% | |
| 50 | 840 | 1.4% | |
| 52 | 782 | 1.3% | |
| 40 | 622 | 1.1% | |
| 54 | 424 | 0.7% | |
| 81 | 382 | 0.7% | |
| 58 | 340 | 0.6% | |
| 82 | 326 | 0.6% | |
| 57 | 319 | 0.5% | |
| 79 | 316 | 0.5% | |
| Other values (51) | 2995 | 5.2% |
| Value | Count | Frequency (%) | |
| 27 | 3 | < 0.1% | |
| 36 | 61 | 0.1% | |
| 37 | 13308 | 22.9% | |
| 38 | 1285 | 2.2% | |
| 39 | 923 | 1.6% | |
| 40 | 622 | 1.1% | |
| 41 | 3299 | 5.7% | |
| 42 | 1379 | 2.4% | |
| 43 | 2438 | 4.2% | |
| 44 | 2596 | 4.5% |
| Value | Count | Frequency (%) | |
| 126 | 1 | < 0.1% | |
| 123 | 8 | < 0.1% | |
| 121 | 1 | < 0.1% | |
| 120 | 2 | < 0.1% | |
| 116 | 3 | < 0.1% | |
| 114 | 1 | < 0.1% | |
| 111 | 1 | < 0.1% | |
| 108 | 28 | < 0.1% | |
| 107 | 72 | 0.1% | |
| 106 | 87 | 0.1% |
| Distinct count | 206 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.019448275862068966 |
|---|---|
| Minimum | -4821.0 |
| Maximum | 5075.0 |
| Zeros | 35878 |
| Zeros (%) | 61.9% |
| Memory size | 453.2 KiB |
Quantile statistics
| Minimum | -4821 |
|---|---|
| 5-th percentile | -4 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 4 |
| Maximum | 5075 |
| Range | 9896 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 77.95803508 |
|---|---|
| Coefficient of variation (CV) | -4008.480527 |
| Kurtosis | 2647.317448 |
| Mean | -0.01944827586 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.438145983 |
| Sum | -1128 |
| Variance | 6077.455233 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 35878 | 61.9% | |
| -1 | 3828 | 6.6% | |
| 1 | 3713 | 6.4% | |
| 2 | 2225 | 3.8% | |
| -2 | 2210 | 3.8% | |
| 3 | 2023 | 3.5% | |
| 4 | 1692 | 2.9% | |
| 5 | 1556 | 2.7% | |
| -3 | 1520 | 2.6% | |
| -4 | 1386 | 2.4% | |
| -5 | 1247 | 2.1% | |
| 6 | 59 | 0.1% | |
| -6 | 54 | 0.1% | |
| -8 | 32 | 0.1% | |
| 7 | 24 | < 0.1% | |
| -7 | 22 | < 0.1% | |
| 8 | 19 | < 0.1% | |
| -11 | 16 | < 0.1% | |
| 9 | 16 | < 0.1% | |
| -10 | 15 | < 0.1% | |
| -9 | 14 | < 0.1% | |
| -13 | 11 | < 0.1% | |
| -42 | 11 | < 0.1% | |
| -27 | 11 | < 0.1% | |
| -33 | 10 | < 0.1% | |
| Other values (181) | 408 | 0.7% |
| Value | Count | Frequency (%) | |
| -4821 | 1 | < 0.1% | |
| -4624 | 1 | < 0.1% | |
| -4475 | 1 | < 0.1% | |
| -4184 | 1 | < 0.1% | |
| -4048 | 1 | < 0.1% | |
| -3700 | 1 | < 0.1% | |
| -3161 | 1 | < 0.1% | |
| -2544 | 1 | < 0.1% | |
| -2460 | 1 | < 0.1% | |
| -1865 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5075 | 1 | < 0.1% | |
| 4903 | 1 | < 0.1% | |
| 4692 | 1 | < 0.1% | |
| 4501 | 1 | < 0.1% | |
| 4400 | 1 | < 0.1% | |
| 4254 | 1 | < 0.1% | |
| 3447 | 1 | < 0.1% | |
| 3328 | 1 | < 0.1% | |
| 3049 | 1 | < 0.1% | |
| 2561 | 1 | < 0.1% |
A3
Real number (ℝ≥0)
| Distinct count | 51 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 85.34912068965517 |
|---|---|
| Minimum | 21.0 |
| Maximum | 149.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 453.2 KiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 76 |
| Q1 | 79 |
| median | 83 |
| Q3 | 89 |
| 95-th percentile | 106 |
| Maximum | 149 |
| Range | 128 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 8.902768762 |
|---|---|
| Coefficient of variation (CV) | 0.1043100232 |
| Kurtosis | 0.543597328 |
| Mean | 85.34912069 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 1.096128712 |
| Sum | 4950249 |
| Variance | 79.25929163 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 77 | 6100 | 10.5% | |
| 81 | 4860 | 8.4% | |
| 79 | 4631 | 8.0% | |
| 86 | 4389 | 7.6% | |
| 76 | 3935 | 6.8% | |
| 83 | 3402 | 5.9% | |
| 78 | 2898 | 5.0% | |
| 84 | 2392 | 4.1% | |
| 80 | 2381 | 4.1% | |
| 88 | 2319 | 4.0% | |
| 82 | 2151 | 3.7% | |
| 97 | 1720 | 3.0% | |
| 95 | 1695 | 2.9% | |
| 87 | 1541 | 2.7% | |
| 96 | 1231 | 2.1% | |
| 85 | 1230 | 2.1% | |
| 106 | 1153 | 2.0% | |
| 75 | 961 | 1.7% | |
| 93 | 806 | 1.4% | |
| 90 | 790 | 1.4% | |
| 92 | 758 | 1.3% | |
| 108 | 749 | 1.3% | |
| 98 | 635 | 1.1% | |
| 89 | 625 | 1.1% | |
| 104 | 623 | 1.1% | |
| Other values (26) | 4025 | 6.9% |
| Value | Count | Frequency (%) | |
| 21 | 1 | < 0.1% | |
| 29 | 2 | < 0.1% | |
| 40 | 1 | < 0.1% | |
| 44 | 1 | < 0.1% | |
| 64 | 2 | < 0.1% | |
| 71 | 15 | < 0.1% | |
| 72 | 26 | < 0.1% | |
| 73 | 14 | < 0.1% | |
| 74 | 183 | 0.3% | |
| 75 | 961 | 1.7% |
| Value | Count | Frequency (%) | |
| 149 | 1 | < 0.1% | |
| 141 | 1 | < 0.1% | |
| 118 | 1 | < 0.1% | |
| 113 | 60 | 0.1% | |
| 112 | 49 | 0.1% | |
| 111 | 136 | 0.2% | |
| 110 | 94 | 0.2% | |
| 109 | 405 | 0.7% | |
| 108 | 749 | 1.3% | |
| 107 | 558 | 1.0% |
| Distinct count | 137 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.25967241379310346 |
|---|---|
| Minimum | -3939.0 |
| Maximum | 3830.0 |
| Zeros | 38055 |
| Zeros (%) | 65.6% |
| Memory size | 453.2 KiB |
Quantile statistics
| Minimum | -3939 |
|---|---|
| 5-th percentile | -4 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 5 |
| Maximum | 3830 |
| Range | 7769 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 36.5215156 |
|---|---|
| Coefficient of variation (CV) | 140.6445724 |
| Kurtosis | 7698.224846 |
| Mean | 0.2596724138 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 31.68785333 |
| Sum | 15061 |
| Variance | 1333.821102 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 38055 | 65.6% | |
| -1 | 2886 | 5.0% | |
| 1 | 2255 | 3.9% | |
| -2 | 2059 | 3.5% | |
| 2 | 1672 | 2.9% | |
| -3 | 1593 | 2.7% | |
| 3 | 1277 | 2.2% | |
| -4 | 1086 | 1.9% | |
| 4 | 1038 | 1.8% | |
| 5 | 964 | 1.7% | |
| 6 | 890 | 1.5% | |
| -6 | 885 | 1.5% | |
| -5 | 845 | 1.5% | |
| -7 | 792 | 1.4% | |
| 8 | 762 | 1.3% | |
| 7 | 523 | 0.9% | |
| -8 | 58 | 0.1% | |
| 9 | 26 | < 0.1% | |
| -9 | 18 | < 0.1% | |
| -10 | 17 | < 0.1% | |
| -12 | 16 | < 0.1% | |
| -11 | 16 | < 0.1% | |
| 10 | 15 | < 0.1% | |
| 13 | 13 | < 0.1% | |
| -13 | 13 | < 0.1% | |
| Other values (112) | 226 | 0.4% |
| Value | Count | Frequency (%) | |
| -3939 | 1 | < 0.1% | |
| -2044 | 1 | < 0.1% | |
| -1108 | 1 | < 0.1% | |
| -674 | 1 | < 0.1% | |
| -587 | 1 | < 0.1% | |
| -495 | 1 | < 0.1% | |
| -478 | 1 | < 0.1% | |
| -362 | 1 | < 0.1% | |
| -318 | 1 | < 0.1% | |
| -273 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3830 | 1 | < 0.1% | |
| 3743 | 1 | < 0.1% | |
| 2674 | 1 | < 0.1% | |
| 2565 | 1 | < 0.1% | |
| 2006 | 1 | < 0.1% | |
| 1751 | 1 | < 0.1% | |
| 1167 | 1 | < 0.1% | |
| 769 | 1 | < 0.1% | |
| 737 | 1 | < 0.1% | |
| 692 | 1 | < 0.1% |
| Distinct count | 54 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34.54986206896552 |
|---|---|
| Minimum | -188.0 |
| Maximum | 436.0 |
| Zeros | 706 |
| Zeros (%) | 1.2% |
| Memory size | 453.2 KiB |
Quantile statistics
| Minimum | -188 |
|---|---|
| 5-th percentile | -10 |
| Q1 | 26 |
| median | 42 |
| Q3 | 46 |
| 95-th percentile | 56 |
| Maximum | 436 |
| Range | 624 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 21.66013857 |
|---|---|
| Coefficient of variation (CV) | 0.6269240243 |
| Kurtosis | 8.547202399 |
| Mean | 34.54986207 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -1.162427473 |
| Sum | 2003892 |
| Variance | 469.1616028 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 46 | 6705 | 11.6% | |
| 42 | 6240 | 10.8% | |
| 44 | 5947 | 10.3% | |
| 38 | 4567 | 7.9% | |
| 50 | 4162 | 7.2% | |
| 54 | 3205 | 5.5% | |
| 52 | 3084 | 5.3% | |
| 36 | 2257 | 3.9% | |
| 56 | 1733 | 3.0% | |
| 34 | 1572 | 2.7% | |
| 28 | 1418 | 2.4% | |
| 26 | 1222 | 2.1% | |
| 24 | 1182 | 2.0% | |
| 20 | 1177 | 2.0% | |
| 30 | 1033 | 1.8% | |
| 18 | 934 | 1.6% | |
| 16 | 818 | 1.4% | |
| 8 | 815 | 1.4% | |
| 10 | 809 | 1.4% | |
| 12 | 762 | 1.3% | |
| -2 | 710 | 1.2% | |
| 6 | 709 | 1.2% | |
| 0 | 706 | 1.2% | |
| -4 | 691 | 1.2% | |
| 70 | 533 | 0.9% | |
| Other values (29) | 5009 | 8.6% |
| Value | Count | Frequency (%) | |
| -188 | 4 | < 0.1% | |
| -160 | 1 | < 0.1% | |
| -100 | 2 | < 0.1% | |
| -46 | 66 | 0.1% | |
| -42 | 326 | 0.6% | |
| -40 | 464 | 0.8% | |
| -38 | 71 | 0.1% | |
| -36 | 67 | 0.1% | |
| -32 | 68 | 0.1% | |
| -30 | 78 | 0.1% |
| Value | Count | Frequency (%) | |
| 436 | 2 | < 0.1% | |
| 336 | 2 | < 0.1% | |
| 310 | 1 | < 0.1% | |
| 98 | 1 | < 0.1% | |
| 72 | 357 | 0.6% | |
| 70 | 533 | 0.9% | |
| 68 | 89 | 0.2% | |
| 64 | 96 | 0.2% | |
| 62 | 154 | 0.3% | |
| 60 | 223 | 0.4% |
| Distinct count | 299 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.6081896551724137 |
|---|---|
| Minimum | -26739.0 |
| Maximum | 15164.0 |
| Zeros | 18420 |
| Zeros (%) | 31.8% |
| Memory size | 453.2 KiB |
Quantile statistics
| Minimum | -26739 |
|---|---|
| 5-th percentile | -23 |
| Q1 | -5 |
| median | 0 |
| Q3 | 5 |
| 95-th percentile | 24 |
| Maximum | 15164 |
| Range | 41903 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 217.5976752 |
|---|---|
| Coefficient of variation (CV) | 135.3059787 |
| Kurtosis | 5979.233609 |
| Mean | 1.608189655 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -21.86181042 |
| Sum | 93275 |
| Variance | 47348.74824 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 18420 | 31.8% | |
| -1 | 1333 | 2.3% | |
| 1 | 1316 | 2.3% | |
| -4 | 1236 | 2.1% | |
| -3 | 1193 | 2.1% | |
| -2 | 1161 | 2.0% | |
| -6 | 1122 | 1.9% | |
| 6 | 1092 | 1.9% | |
| 3 | 1078 | 1.9% | |
| 5 | 1077 | 1.9% | |
| 2 | 1073 | 1.8% | |
| 4 | 1026 | 1.8% | |
| -5 | 1001 | 1.7% | |
| 7 | 877 | 1.5% | |
| -7 | 840 | 1.4% | |
| 8 | 809 | 1.4% | |
| 9 | 794 | 1.4% | |
| -8 | 783 | 1.4% | |
| 11 | 730 | 1.3% | |
| 10 | 728 | 1.3% | |
| -9 | 703 | 1.2% | |
| -10 | 694 | 1.2% | |
| -11 | 691 | 1.2% | |
| 12 | 681 | 1.2% | |
| 14 | 672 | 1.2% | |
| Other values (274) | 16870 | 29.1% |
| Value | Count | Frequency (%) | |
| -26739 | 1 | < 0.1% | |
| -13839 | 1 | < 0.1% | |
| -12809 | 1 | < 0.1% | |
| -11042 | 1 | < 0.1% | |
| -10453 | 1 | < 0.1% | |
| -8392 | 1 | < 0.1% | |
| -4141 | 1 | < 0.1% | |
| -2944 | 1 | < 0.1% | |
| -2385 | 1 | < 0.1% | |
| -2377 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 15164 | 1 | < 0.1% | |
| 13148 | 1 | < 0.1% | |
| 12588 | 1 | < 0.1% | |
| 12169 | 1 | < 0.1% | |
| 11749 | 1 | < 0.1% | |
| 9931 | 1 | < 0.1% | |
| 8098 | 1 | < 0.1% | |
| 7973 | 1 | < 0.1% | |
| 6339 | 1 | < 0.1% | |
| 4910 | 1 | < 0.1% |
A7
Real number (ℝ)
| Distinct count | 86 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.09231034482759 |
|---|---|
| Minimum | -48.0 |
| Maximum | 105.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 453.2 KiB |
Quantile statistics
| Minimum | -48 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 32 |
| median | 39 |
| Q3 | 42 |
| 95-th percentile | 61 |
| Maximum | 105 |
| Range | 153 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 13.11142808 |
|---|---|
| Coefficient of variation (CV) | 0.3534810303 |
| Kurtosis | 1.62177146 |
| Mean | 37.09231034 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.3684014245 |
| Sum | 2151354 |
| Variance | 171.9095462 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 40 | 4830 | 8.3% | |
| 41 | 4145 | 7.1% | |
| 42 | 3416 | 5.9% | |
| 39 | 3250 | 5.6% | |
| 38 | 2791 | 4.8% | |
| 43 | 2653 | 4.6% | |
| 37 | 2522 | 4.3% | |
| 35 | 2067 | 3.6% | |
| 36 | 2053 | 3.5% | |
| 33 | 1687 | 2.9% | |
| 34 | 1545 | 2.7% | |
| 32 | 1372 | 2.4% | |
| 31 | 1340 | 2.3% | |
| 22 | 1328 | 2.3% | |
| 4 | 1321 | 2.3% | |
| 44 | 1202 | 2.1% | |
| 45 | 1195 | 2.1% | |
| 23 | 1132 | 2.0% | |
| 30 | 1086 | 1.9% | |
| 25 | 1064 | 1.8% | |
| 46 | 996 | 1.7% | |
| 28 | 960 | 1.7% | |
| 21 | 928 | 1.6% | |
| 29 | 882 | 1.5% | |
| 26 | 870 | 1.5% | |
| Other values (61) | 11365 | 19.6% |
| Value | Count | Frequency (%) | |
| -48 | 1 | < 0.1% | |
| -43 | 2 | < 0.1% | |
| -27 | 1 | < 0.1% | |
| -26 | 2 | < 0.1% | |
| -19 | 1 | < 0.1% | |
| -18 | 8 | < 0.1% | |
| -16 | 1 | < 0.1% | |
| -15 | 1 | < 0.1% | |
| -10 | 3 | < 0.1% | |
| -8 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 105 | 1 | < 0.1% | |
| 104 | 1 | < 0.1% | |
| 75 | 2 | < 0.1% | |
| 73 | 10 | < 0.1% | |
| 72 | 50 | 0.1% | |
| 71 | 149 | 0.3% | |
| 70 | 86 | 0.1% | |
| 69 | 518 | 0.9% | |
| 68 | 428 | 0.7% | |
| 67 | 477 | 0.8% |
| Distinct count | 123 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.88455172413793 |
|---|---|
| Minimum | -353.0 |
| Maximum | 270.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 453.2 KiB |
Quantile statistics
| Minimum | -353 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 37 |
| median | 44 |
| Q3 | 60 |
| 95-th percentile | 94 |
| Maximum | 270 |
| Range | 623 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 21.41805059 |
|---|---|
| Coefficient of variation (CV) | 0.4209145972 |
| Kurtosis | 8.41627801 |
| Mean | 50.88455172 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.06620442 |
| Sum | 2951304 |
| Variance | 458.7328912 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 37 | 2772 | 4.8% | |
| 39 | 2703 | 4.7% | |
| 41 | 2600 | 4.5% | |
| 35 | 2408 | 4.2% | |
| 44 | 1993 | 3.4% | |
| 42 | 1988 | 3.4% | |
| 32 | 1901 | 3.3% | |
| 34 | 1689 | 2.9% | |
| 46 | 1640 | 2.8% | |
| 43 | 1525 | 2.6% | |
| 40 | 1500 | 2.6% | |
| 30 | 1446 | 2.5% | |
| 38 | 1343 | 2.3% | |
| 36 | 1337 | 2.3% | |
| 48 | 1320 | 2.3% | |
| 33 | 1254 | 2.2% | |
| 45 | 1071 | 1.8% | |
| 49 | 961 | 1.7% | |
| 50 | 941 | 1.6% | |
| 55 | 853 | 1.5% | |
| 51 | 814 | 1.4% | |
| 28 | 798 | 1.4% | |
| 29 | 786 | 1.4% | |
| 31 | 773 | 1.3% | |
| 57 | 744 | 1.3% | |
| Other values (98) | 20840 | 35.9% |
| Value | Count | Frequency (%) | |
| -353 | 2 | < 0.1% | |
| -258 | 2 | < 0.1% | |
| -191 | 1 | < 0.1% | |
| -14 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 16 | 1 | < 0.1% | |
| 20 | 1 | < 0.1% | |
| 21 | 8 | < 0.1% | |
| 22 | 49 | 0.1% | |
| 23 | 209 | 0.4% |
| Value | Count | Frequency (%) | |
| 270 | 1 | < 0.1% | |
| 269 | 1 | < 0.1% | |
| 265 | 2 | < 0.1% | |
| 240 | 1 | < 0.1% | |
| 184 | 2 | < 0.1% | |
| 131 | 3 | < 0.1% | |
| 130 | 45 | 0.1% | |
| 129 | 28 | < 0.1% | |
| 128 | 157 | 0.3% | |
| 127 | 65 | 0.1% |
| Distinct count | 77 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.932413793103448 |
|---|---|
| Minimum | -356.0 |
| Maximum | 266.0 |
| Zeros | 20134 |
| Zeros (%) | 34.7% |
| Memory size | 453.2 KiB |
Quantile statistics
| Minimum | -356 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 14 |
| 95-th percentile | 78 |
| Maximum | 266 |
| Range | 622 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 25.61401796 |
|---|---|
| Coefficient of variation (CV) | 1.838447978 |
| Kurtosis | 8.438483988 |
| Mean | 13.93241379 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 2.243243705 |
| Sum | 808080 |
| Variance | 656.0779162 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 0 | 20134 | 34.7% | |
| 2 | 14758 | 25.4% | |
| 6 | 1775 | 3.1% | |
| 8 | 1731 | 3.0% | |
| 4 | 1694 | 2.9% | |
| 14 | 1540 | 2.7% | |
| 12 | 1255 | 2.2% | |
| 16 | 1120 | 1.9% | |
| 10 | 881 | 1.5% | |
| 32 | 819 | 1.4% | |
| 34 | 730 | 1.3% | |
| 18 | 707 | 1.2% | |
| 22 | 635 | 1.1% | |
| 24 | 595 | 1.0% | |
| 30 | 592 | 1.0% | |
| 40 | 555 | 1.0% | |
| 26 | 473 | 0.8% | |
| 28 | 469 | 0.8% | |
| 36 | 458 | 0.8% | |
| 42 | 415 | 0.7% | |
| 56 | 319 | 0.5% | |
| 20 | 318 | 0.5% | |
| 46 | 294 | 0.5% | |
| 58 | 289 | 0.5% | |
| 60 | 278 | 0.5% | |
| Other values (52) | 5166 | 8.9% |
| Value | Count | Frequency (%) | |
| -356 | 2 | < 0.1% | |
| -298 | 2 | < 0.1% | |
| -264 | 1 | < 0.1% | |
| -18 | 1 | < 0.1% | |
| -14 | 3 | < 0.1% | |
| -12 | 2 | < 0.1% | |
| -2 | 1 | < 0.1% | |
| 0 | 20134 | 34.7% | |
| 2 | 14758 | 25.4% | |
| 4 | 1694 | 2.9% |
| Value | Count | Frequency (%) | |
| 266 | 1 | < 0.1% | |
| 244 | 1 | < 0.1% | |
| 242 | 1 | < 0.1% | |
| 226 | 1 | < 0.1% | |
| 196 | 1 | < 0.1% | |
| 180 | 2 | < 0.1% | |
| 126 | 56 | 0.1% | |
| 124 | 182 | 0.3% | |
| 122 | 176 | 0.3% | |
| 120 | 212 | 0.4% |
target
Real number (ℝ≥0)
| Distinct count | 7 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.6947758620689655 |
|---|---|
| Minimum | 1 |
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 453.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 5 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.350960336 |
|---|---|
| Coefficient of variation (CV) | 0.7971321558 |
| Kurtosis | 0.448393212 |
| Mean | 1.694775862 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.502523229 |
| Sum | 98297 |
| Variance | 1.825093831 |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 1 | 45586 | 78.6% | |
| 4 | 8903 | 15.3% | |
| 5 | 3267 | 5.6% | |
| 3 | 171 | 0.3% | |
| 2 | 50 | 0.1% | |
| 7 | 13 | < 0.1% | |
| 6 | 10 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1 | 45586 | 78.6% | |
| 2 | 50 | 0.1% | |
| 3 | 171 | 0.3% | |
| 4 | 8903 | 15.3% | |
| 5 | 3267 | 5.6% | |
| 6 | 10 | < 0.1% | |
| 7 | 13 | < 0.1% |
| Value | Count | Frequency (%) | |
| 7 | 13 | < 0.1% | |
| 6 | 10 | < 0.1% | |
| 5 | 3267 | 5.6% | |
| 4 | 8903 | 15.3% | |
| 3 | 171 | 0.3% | |
| 2 | 50 | 0.1% | |
| 1 | 45586 | 78.6% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| A1 | A2 | A3 | A4 | A5 | A6 | A7 | A8 | A9 | target | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 50.0 | 21.0 | 77.0 | 0.0 | 28.0 | 0.0 | 27.0 | 48.0 | 22.0 | 2 |
| 1 | 55.0 | 0.0 | 92.0 | 0.0 | 0.0 | 26.0 | 36.0 | 92.0 | 56.0 | 4 |
| 2 | 53.0 | 0.0 | 82.0 | 0.0 | 52.0 | -5.0 | 29.0 | 30.0 | 2.0 | 1 |
| 3 | 37.0 | 0.0 | 76.0 | 0.0 | 28.0 | 18.0 | 40.0 | 48.0 | 8.0 | 1 |
| 4 | 37.0 | 0.0 | 79.0 | 0.0 | 34.0 | -26.0 | 43.0 | 46.0 | 2.0 | 1 |
| 5 | 85.0 | 0.0 | 88.0 | -4.0 | 6.0 | 1.0 | 3.0 | 83.0 | 80.0 | 5 |
| 6 | 56.0 | 0.0 | 81.0 | 0.0 | -4.0 | 11.0 | 25.0 | 86.0 | 62.0 | 4 |
| 7 | 55.0 | -1.0 | 95.0 | -3.0 | 54.0 | -4.0 | 40.0 | 41.0 | 2.0 | 1 |
| 8 | 53.0 | 8.0 | 77.0 | 0.0 | 28.0 | 0.0 | 23.0 | 48.0 | 24.0 | 4 |
| 9 | 37.0 | 0.0 | 101.0 | -7.0 | 28.0 | 0.0 | 64.0 | 73.0 | 8.0 | 1 |
Last rows
| A1 | A2 | A3 | A4 | A5 | A6 | A7 | A8 | A9 | target | |
|---|---|---|---|---|---|---|---|---|---|---|
| 57990 | 38.0 | 2.0 | 79.0 | 0.0 | 38.0 | 18.0 | 42.0 | 41.0 | 0.0 | 1 |
| 57991 | 101.0 | 0.0 | 102.0 | 0.0 | 70.0 | -3.0 | 1.0 | 33.0 | 32.0 | 5 |
| 57992 | 39.0 | -2.0 | 80.0 | -4.0 | 38.0 | 0.0 | 41.0 | 41.0 | 0.0 | 1 |
| 57993 | 43.0 | 0.0 | 81.0 | 1.0 | 42.0 | -9.0 | 37.0 | 39.0 | 2.0 | 1 |
| 57994 | 49.0 | 0.0 | 87.0 | 0.0 | 46.0 | -12.0 | 38.0 | 41.0 | 2.0 | 1 |
| 57995 | 80.0 | 0.0 | 84.0 | 0.0 | -36.0 | -29.0 | 4.0 | 120.0 | 116.0 | 5 |
| 57996 | 55.0 | 0.0 | 81.0 | 0.0 | -20.0 | 25.0 | 26.0 | 102.0 | 76.0 | 4 |
| 57997 | 55.0 | 0.0 | 77.0 | 0.0 | 12.0 | -22.0 | 22.0 | 65.0 | 42.0 | 4 |
| 57998 | 37.0 | 0.0 | 103.0 | 0.0 | 18.0 | -16.0 | 66.0 | 85.0 | 20.0 | 1 |
| 57999 | 56.0 | 2.0 | 98.0 | 0.0 | 52.0 | 1.0 | 42.0 | 46.0 | 4.0 | 4 |